PyDigger - unearthing stuff about Python


NameVersionSummarydate
kedro-viz 10.1.0 Kedro-Viz helps visualise Kedro data and analytics pipelines 2024-11-21 20:16:56
dbt_coves 1.8.12 CLI tool for dbt users adopting analytics engineering best practices. 2024-10-24 20:29:14
DeepCoreML 0.4.0 A collection of Machine Learning techniques for data management, engineering and augmentation. 2024-09-30 08:53:03
aws-json-dataset 0.1.0 Send JSON datasets to various AWS services. 2024-02-03 22:56:45
scistag 0.9.0 A stack of helpful libraries & applications for the rapid development of data driven solutions. 2024-01-15 22:38:53
dcw 0.0.10 2024-01-15 21:15:25
datasaurus 0.0.2.dev4 Data Engineering framework based on Polars.rs 2023-12-19 12:10:51
deepCoreML 0.1 A collection of Machine Learning techniques for data management and augmentation. 2023-11-29 22:13:22
pipelinex 0.7.9 PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more 2023-11-28 12:52:31
dtflw 0.6.7 dtflw is a Python framework for building modular data pipelines based on Databricks dbutils.notebook API. 2023-10-29 13:28:29
wiz-craft 1.1.1 A CLI-based dataset preprocessing tool for machine learning tasks. Features include data exploration, null value handling, one-hot encoding, and feature scaling, and download the modified dataset effortlessly. 2023-10-18 08:32:14
pycurie 0.1.16 2023-10-17 16:56:22
ParallelFileConcatenator 0.1 ParallelFileConcatenator is a robust tool designed to efficiently combine data files of various formats (CSV, Feather, Parquet, XLSX, XLS) from a specified directory. 2023-08-22 09:34:18
dbt-coves 1.6.0 CLI tool for dbt users adopting analytics engineering best practices. 2023-08-10 18:57:39
dsutils-ms 1.10 My Data Science Utils 2023-07-19 16:16:44
chartstag 0.8.2 Charting and diagram extension for SciStag 2023-06-15 21:28:04
flowrunner 0.2.3 Flowrunner is a lightweight package to organize and represent Data Engineering/Science workflows 2023-06-08 15:59:53
duckingit 0.0.11 A framework to leverage clusters of serverless functions for analytics. Powered by DuckDB 2023-05-23 19:26:49
adcpipeline 0.2.1 A pipeline for a structured way of working 2023-04-24 11:35:44
dataset-shuffler 0.1.1 Data engineering tool for learning-based computer vision. 2023-01-28 23:21:55
hourdayweektotal
3311009535274549
Elapsed time: 8.93438s